🐿️ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🏠 Local LLM Deployment

Model Optimization, GPU Acceleration, Inference, Privacy

Using Self-Hosted Large Language Models (LLMs) Securely in Government
digitaltrade.blog.gov.uk·6h·
Discuss: Hacker News
🖥️Self-hosted apps
Evaluation of Large Language Model-Driven AutoML in Data and Model Management from Human-Centered Perspective
arxiv.org·16h
🗃️SQLite
Context Kills VRAM: How to Run LLMs on consumer GPUs | by Lyx | May, 2025 | Medium
medium.com·2d
🗃️SQLite
Code vs LLM in a simple planning poker agent example
dev.to·3h·
Discuss: DEV
🗃️SQLite
Show HN: LocalCloud – Run complete AI stack locally for $0
github.com·1d·
Discuss: Hacker News
🖥️Self-hosted apps
Helix Parallelism: Sharding Strategies for Multi-Million-Token LLM Decoding
research.nvidia.com·1h·
Discuss: Hacker News
🗃️SQLite
T5Gemma: A new collection of encoder-decoder Gemma models
developers.googleblog.com·4h·
Discuss: Hacker News
🗃️SQLite
How I use LLMs to learn new subjects
seangoedecke.com·1d
🧠Personal Knowledge Base
Why is RL important, especially for LLMs?
opipe.notion.site·20h·
Discuss: Hacker News
🖥️Self-hosted apps
[P] Pruning Benchmarks for computer vision models
reddit.com·16h·
Discuss: r/MachineLearning
🗃️SQLite
We solved AI API interoperability
supermemory.ai·1d·
Discuss: Hacker News
🖥️Self-hosted apps
Automating Enterprise Applications with Implementation of LLM Frameworks
blog.devops.dev·7h
🖥️Self-hosted apps
A language model built for the public good
ethz.ch·15h·
Discuss: Hacker News
🖥️Self-hosted apps
CAVGAN: Unifying Jailbreak and Defense of LLMs via Generative Adversarial Attacks on their Internal Representations
arxiv.org·16h
🖥️Self-hosted apps
The Evolution of AI Job Orchestration. Part 1: Running AI Jobs on GPU Neoclouds
blog.skypilot.co·1d·
Discuss: Hacker News
🖥️Self-hosted apps
(Attempting to) Engineer the chaos out of AI agents
trunk.io·36m·
Discuss: Hacker News
🖥️Self-hosted apps
Introducing Phi-4-mini-flash-reasoning
azure.microsoft.com·3h·
Discuss: Hacker News
🖥️Self-hosted apps
Beyond the Prototype: 15 Hard-Earned Lessons to Ship Production-Ready AI Agents
hackernoon.com·23h
🖥️Self-hosted apps
LLM Explained, The Technology Behind ChatGPT
dev.to·4h·
Discuss: DEV
🖥️Self-hosted apps
Torpor: GPU-Enabled Serverless Computing for Low-Latency, Resource-Efficient Inference
arxiv.org·16h
🗃️SQLite
Loading...Loading more...
AboutBlogChangelogRoadmap